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Abstract 

We investigate the two components of the total daily return (close-to-close) , the overnight return 
(close-to-open) and the daytime return (open-to-close), as well as the corresponding volatilities of 
the 2215 NYSE stocks from 1988 to 2007. The tail distribution of the volatility the long-term 
memory in the sequence, and the cross-correlation between different returns are analyzed. Our 
results suggest that: (i) The two component returns and volatilities have similar features as that 
of the total return and volatility. The tail distribution follows a power law for all volatilities, and 
long-term correlations exist in the volatility sequences but not in the return sequences, (ii) The 
daytime return contributes more to the total return. Both the tail distribution and the long-term 
memory of the daytime volatility are more similar to that of the total volatility, compared to the 
overnight records. In addition, the cross-correlation between the daytime return and the total 
return is also stronger, (iii) The two component returns tend to be anti-correlated. Moreover, we 
find that the cross-correlations between the three different returns (total, overnight, and daytime) 
are quite stable over the entire 20-year period. 
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I. INTRODUCTION 



Financial markets are of great importance for economics and econophysics research 
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211 ] . A key topic of the market 
studies is the price dynamics, which could be measured by the price change ( "return" ) and 
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Especially, the volatility has important practical implications. For example, it is the key 
input for option pricing models such as the classic Black-Scholes model and Cox, Ross, and 



Rubinstein binomial models 
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171 ] . Usually financial markets are closed during the night, 



and all news or events in the night are reflected in the opening price of the next trading day. 
A day (from former day closing to current day closing) therefore can be decomposed into 
two sessions, overnight (from former day closing to current day opening) and daytime (from 
current day opening to closing) sessions. The study of the returns and the volatilities during 
these two sessions might provide new insights towards better understanding of the financial 
markets. Practically, this study can help traders to improve trading strategies at the market 
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opening and closing. It also can help investors to analyze the dually-traded equities 

Recently there were some studies on the returns and volatilities over sub-day sessions. 
George and Hwang decomposed the daily return of 200 Japanese stocks and analyzed their 
volatility patterns [18j. Wang et. al. studied 15 stocks which are traded in both Hong 
Kong and London but in different hours 19j. However, there is still lack of a comprehensive 
analysis of the overnight and daytime price change for a leading market such as the New 



York Stock Exchange (NYSE). For the dai 
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ities of stock prices are well studied 



and high-frequency intraday data, returns and 
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These studies show that the return and volatility distribution decay as power 



laws, and the correlations in the returns disappear after few minutes whi 
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the volatility time series can exist upto months and even longer 

HI 

of interest to examine whether these features persist also in the two component returns and 
volatilities. Obviously one can assume that the overnight price change behaves statistically 
different from the daytime change. What are the differences? Furthermore, the influence of 
the overnight price change on the daytime change is also of interest and should be examined. 

In this paper we examine the daily data for all stocks traded in NYSE. First we study 
the fundamental features of the time series, distribution of the records and the correlations 
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in the sequence. Three types of functions, power law, exponential, and power law with 
an exponential cutoff, are tested for the tail of the volatility distribution. We find that the 
power law function fits best for most stocks. Then we analyze the l ong-ter m memory of each 



stock using the detrended fluctuation analysis (DFA) method [271. l28l. l29l. |30||. and find that 
the long-term correlations persist in the volatilities of both components. We show that the 
distribution and the long-term memory of the daytime volatility is more similar to the total 
volatility, compared to the overnight volatility. Further, we study the cross-correlations 
between the three types of returns (total, overnight, and daytime). The two component 
returns are found to be weakly anti-correlated but both overnight and daytime return are 
strongly correlated with the total return. Interestingly, we find that this behavior is quite 
stable during the entire 20-year period. 

II. DATA AND VARIABLES 

We collect the daily opening and closing prices of all securities that are listed in NYSE 



on December 31, 2007, in total 2215 stocks 3jJ. The record starts from January 2, 1962, 



but many stocks have a much shorter history. We do not include the data before 1987 
period for two reasons. First, from 1962 to 1987 there exist only very little data, about 
6.5% of all the data points for these 2215 stocks. Second and more important, there was 
a huge market crash on October 19, 1987 ("Black Monday"), and after that the market 
was adapted in a great extent. Thus, to reduce the complexity of market structure, we 
only examine the data from 1988 to 2007, in total 20 years. The length of the 2215 stocks 
ranges from N = 1000 to 5000 trading days. Note that many stocks have splits in the 
20-year period, which causes significant change in the price. Therefore, we adjust all prices 
according to the historical splits. The 2215 stocks cover all industrial sectors, a wide range 
of the stock market capitalization (from 6 x 10 6 to 5 x 10 11 dollars), and a wide range of the 
average daily volume (from 500 to 2 x 10 7 shares a day). 

Now we define two basic measures, return R and volatility V. The daily return is the 
logarithmic change of the successive daily closing prices ("total return"), 

R T (t) =Hp dose (t)/p dose (t - 1)); (1) 
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the return over the overnight session ("overnight return") is 

R N {t) = \n{v open {t)/ V dose {t-l))- 



and the return over the daytime session ("daytime return") is 



R D (t) = \n(p dose (t)/p°P en (t)). 



(2) 



(3) 



Here p dose (t) is the closing price and p open (t) is the opening price at day t. Note that 
Rxit) = i?Ar(t) + #£>(i), -Rd(^) and -Rjv(£) are in the same day and Rnif) is after R^it). 
Fig. [T] shows the three types of return for a typical stock AA (Alcoa^ 



2007. The volatility is defined as the absolute value of the return 
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from 1988 to 
i.e. 



V = \R\. 



(4) 



Thus, corresponding to the three types of return, we have three types of volatility, the total 
volatility Vr, the overnight volatility Vjv, and the daytime volatility Vd- 

III. TAIL OF VOLATILITY DISTRIBUTION 



The tail distribution accounts for large fluctuations and events which are very important 

for risk analysis. By the definition [Eq. (j3j)], the volatility aggregates both positive and 

negative returns and has better statistics. In addition, the distribution of the return is 

I — I 

approximately symmetric in the two tails [20j]. Therefore we focus on the tail distribution 
of the volatility. As a stylized fact of econophysics research, the cumulative distribution 



function (CDF) J32L0 
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volatilities has a "fat tail" which is usually characterized by a power 



25|, 



P(x) ~ x 



where ( is the tail exponent. A classical approach to fit the tail is using 



(5) 

;he Maximum 



Likelihood Estimator, which is called Hill estimator for a power law tai 
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goodness-of-fit is tested by the Kolmogorov-Smirnov (KS) statistic D [33|, |34j], the maximum 
absolute difference between the cumulative distribution of the measured distribution P(x) 
and that of the fit S(x), i.e., 



D = max(|P(x) — S(x)\), 



(6) 
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TABLE I: Number of good fit of the volatility tail distribution for the 2215 NYSE stocks. Good 
fit refers to the cases where the null hypothesis is not ruled out for 1% significance level. 



Volatility V 


V T 


Vn 


Vd 


Power law 


2066 


1868 


2066 


Exponential 


1693 


644 


1756 


Power law with cutoff 


1755 


1772 


1728 



for all volatility values in the tail 35j . When D is larger than a certain value, which is called 
critical value (CV), the null hypothesis that the distribution follows a power law is rejected. 
The CV is determined by the significance level and data size N. In this paper we choose 
significance level of 1% and the corresponding CV = 1.63/ViV. 

To further test the volatility tail, we also try two other distribution functions in the same 
range and using the same method. One is the exponential distribution function, 



P(x) ~ e 



-x/x* 



(7) 



where x* is a characteristic scale. The other is a power law function with an exponential 
cutoff, 

P{x) ~ ■ e~ x l x * . (8) 

We examine the tail distribution of Vr, Vn and Vd for the 2215 NYSE stocks. The 
number of fit that the null hypothesis was valid under 1% significant level ("good fit") is 
listed in Table [H For the power law distribution, only a small portion (10%) of the three 
types of volatlities are ruled out, which manifests that the tail is well characterized by the 
power law function for the broad market. For the exponential hypothesis, almost half (38%) 
of all cases are ruled out. Moreover, about 98% out of the good exponential fits, the power 
law hypothesis is not ruled out either. As a whole, the exponential function is poor for 
characterizing the tail, compared to the power law function. For the power law with an 
exponential cutoff, the percentage of good fit is 79% over the three volatilities, which is 
slightly lower than that for the power law. Besides, 99% out of them do not reject the power 
law hypothesis either. Therefore, we conclude that the power law is the best among the 
three distributions. 
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In Fig. [21 we plot the CDF of Vr, Vn, and Vd for four typical stocks, namely, Alcoa, Inc. 
(AA), Cambrex Corp. (CBM), Jones Apparel Group, Inc. (JNY), and Marshall & Ilsley 
Corp. (MI). These stocks belong to diverse industrial sectors and their capitalization vary 
in a wide range, from 27 billion dollars for AA to 0.25 billion dollars for MI. As seen in Fig. 
[2J the tails are well fitted by power laws. Interestingly, the tails of Vd almost always decay 
faster than the tails of Vn, and Vr lies between the two component volatilities. Moreover, 
the log- log slope (tail exponent £) of Vr is closer to that of Vd, indicating the daytime 
return contributes more to the total return. To test this finding for the broad market, we 
plot in Fig. [3] the relation between the tail exponent ( of Vr and ( of the two component 
volatilities for the 2215 stocks. Both scatter plots show certain dependence (as shown by the 
solid curves, which are averages over different bins of ( of Vr), but the correlation between 
Vr and Vd is obviously stronger, which is consistent with Fig. [21 For all three types of 
volatilities, ( is distributed in a certain range from 1.5 to 5, and centered around 3. The 
averages of ( are: {() ~ 2.6 for Vn is lower than (() m 3.2 for Vd, while (() ^3.1 for Vr is 
between the two component volatilities and it is slightly smaller than that for Vd- In this 
paper (...) stands for the average over the data set. This behavior suggests that the daytime 
return influences the total return more than the overnight return. 



IV. CORRELATIONS IN RETURNS AND VOLATILITIES 



After analyzing the volatility distribution, a question naturally arises, how these values are 
organized in the time sequence? For the investors, the temporal structure is of special interest 
because it determines how and when to trade. The time organization in a time series can be 
characterized by the two-point correlation. It is known that the total return has only short- 
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term correlations and the total volatility has long-term correlations [20|, 
Now we examine the correlations in each of their two components (overnight and daytime). 

It is well known that financial time series are usually non-stationary. In such cases, the 
conventional methods for correlations such as auto-correlation and spectral analysis have 
spurious effects. To avoid the artifact correlations arising from non-stationarity, we employ 



the DFA method, which is based on the idea that a correlated time series could be mapped to 
a similar process by integration, and re m0 ™»p«cal. y trends in order to detect 



the long-term correlations in the time series [23, m m laoi]. After removing polynomial 
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trends in every equal-size box of i points, DFA computes the root-mean-square fluctuation 
F(£) of a time series and determine the correlation exponent a from the scaling function 



F{£) ~ r, (9) 

where the exponent a G (0, 1), called correlation exponent, characterizes the auto-correlation 
in the sequence. It is uncorrelated if a = 0.5, positively correlated if a > 0.5 and anti- 
correlated if a < 0.5. In Figure HJ we plot DFA curves for the returns and volatilities of the 
total, overnight, and daytime sequences for four typical stocks. The values of a are obtained 
by the power law fit to the fluctuation function, as illustrated by the dashed lines in Fig. 
11(d). For all three types of returns, a is close to 0.5 and therefore there are no long-term 
correlations. For the volatilities, the fluctuation function is more complicated. The slopes 
(in log-log scale) of different regions are significantly different. Thus, we divide the whole 
curve into two equal-size regions in the logarithmic scale and fit them separately, as shown 
by the dashed lines in Fig. 11(d). 

To test the universality of our findings, we plot in Fig. [5] the probability density function 
(PDF, which is the derivative of CDF) of a for the three returns as well as for the short 
and long time scales of the volatilities. For the returns [Fig. [51(a)], the distributions are 
centered around 0.5, a = 0.48 ± 0.04 for the total, a = 0.55 ± 0.05 for the overnight and 
a = 0.52 ± 0.04 for the daytime. Here and in the following, the error bars are the standard 
deviations over all 2215 stocks. These error bars are quite small representing quite narrow 
distributions. This result is consistent with earlier studies, where no long-term correlations 



were found for the returns 20] . For the volatilities at short time scales [Fig. [5(b)], the 
distributions are centered around 0.6, a = 0.63 ± 0.04 for the total [^(J, a = 0.59 ± 0.03 
for the overnight and a = 0.63 ± 0.04 for the day time. For the volatilities at long time 



scales [Fig. [5(c)], a = 0.75 ± 0.10 for the total [20j|, a = 0.71 ± 0.12 for the overnight and 
a = 0.75 ± 0.10 for the daytime. For all time scales, the volatility a values are significantly 
larger than 0.5, suggesting long-term correlations in the volatility sequences. In addition, the 
a values of the long-term scales are systematically larger than that of the short-term scales. 
This multiscaling behavior indicates that the correlation becomes stronger for longer times. 
Moreover, all distributions are relatively narrow for both returns and volatilities, suggesting 
a universal feature over the entire market. We also see that the curves of the total and 
daytime almost collapse onto a single curve, while the curve of the overnight departs away 
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TABLE II: Cross-correlation between the a values of the three types of returns and volatilities 
for the 2215 NYSE stocks. We divide the 2215 stocks into 10 equal-size subsets and calculate the 
cross-correlation for every subset. The error bar is the corresponding standard deviation of the 10 
cross-correlations. The value in the parenthesis is the corresponding cross-correlation between two 
shuffled a records. 



Cross-correlation C 


C(Total, Overnight) 


C (Total, Daytime) 


C(Overnight, Daytime) 


Return 


0.25 ± 0.07 
(-0.00) 


0.51 ±0.08 
(-0.03) 


0.48 ± 0.08 
(0.02) 


Volatility (short time scales) 


0.29 ± 0.09 
(0.04) 


0.80 ±0.04 
(-0.02) 


0.24 ± 0.04 
(0.03) 


Volatility (long time scales) 


0.56 ± 0.06 
(0.01) 


0.90 ±0.02 
(0.02) 


0.52 ±0.05 
(-0.02) 



from them, supporting again that the daytime return contributes more than the overnight 
return to the total return. 

Now we address the question if there is a relation between the correlation exponents a 
of the two components of the return and volatility. If a certain stock has large (small) a for 
one component, does it have also large (small) a in the other component or in the total? 
To test this, we employ the cross-correlation function to quantitatively compare them. The 
cross-correlation (also called the Pearson Coefficient) between variable x and y is 

C(s,„) B <*• f -<',>• fa* ■ (10) 
a{x) ■ a{y) 



Here a stands for the standard deviation, i.e., for variable x, a(x) = a/ (x 2 ) — (x) 2 . For our 
case, x and y are vectors representing the three sequences of a (total, overnight, daytime) 
for the return, short time and long time volatilities for all companies. The companies are in 
the same order for all sequences. As shown in Table HTl all cross-correlations are significantly 
larger than that of shuffled records (values in the parenthesis), suggesting strong relations 
between the different returns or volatilities. Note again that the total-daytime pair is always 
the strongest one, which is in agreement with the assumption that the total return and 
volatility are significantly more influenced by the daytime return and volatility, than by the 
overnight return and volatility. 
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V. RELATION BETWEEN TOTAL, OVERNIGHT AND DAYTIME RETURNS 



The overnight return and the daytime return are the price changes over different sessions 
of a trading day, and they make the total return. It is interesting to examine now if the three 
returns of the same stock are cross-correlated. This will test the question, e.g., how changes 
in the day time are related to those of night time or the total. The cross-correlation function 
[Eq. ([TO]) ] examines the two time series without any time lag. However, there might be some 
time delays between two time series, and therefore we shift the two sequences by time lag 
At to test this possibility. Moreover, the comparison between the cross-correlations with 
different lags allows us to examine the significance of a cross-correlation value. Therefore, 
we use the generalized cross-correlation with the time lag At, i.e., 

r ( , (x(t)-y(t + At))-(x)-(y) 
a{x) ■ a(y) 

between two time series x(t) and y{t). Note that Eq. ffTOj) is the special case of Eq. ffTTl) with 
At = 0. In general one tests the position of the maximum (minimum if it is anti-correlated) 
of C/\t which may occur at At = r and r is called the time delay [36] . Here we find that the 
maximum of C^t is always for At = (as shown in Figure [6]). 

In this paper we use C/\t to test the significance of the cross-correlation at At = 0. If 
C*At=o is significantly different (higher or lower) from Ca^o, the cross-correlation can be 
regarded as reliable. Quantitatively, we use the standard deviation of Ca^o values over 
the range —20 < At < 20, c(Ca^o) ; to test the reliability of the cross-correlation [36]. As 
examples, we plot in Fig. [6] the cross-correlations of three pairs of returns for the four typical 
stocks, AA, CBM, JNY, and MI (other stocks have similar features). For both C(Rt,Rn) 
and C(Rt,Rd), the cross- correlations at At = are more than 10 times higher than their 
c"(Ca^o) so they are very robust. However, for C(Rn, Rd), the cross-correlations vary with 
the stock. Some of them have significant cross-correlation values but some of them are in 
the range of their cr(CAi^o)- Since R^ and Rd covers different periods, there could be some 
strong correlations or almost independent, it is reasonable that the cross-correlation varies in 
a wide range. On the other hand, Rt always shares part of changes with its two component 
returns and deduce strong positive cross-correlations. 

Next we examine the three pairs of cross-correlations Cm=o for all the 2215 stocks (in the 
following, the function C refers to Ca*=o if the At subscript is missing). Their distributions 
are plotted in Fig. [7J For each pair, the cross-correlations are distributed in a certain range. 



The cross-correlation between the total return and the daytime return, C(Rt, Rd) — 0.8±0.1 
(mean value and standard deviation over the 2215 stocks), is always the largest value in 
the three pairs. The cross-correlation between the total and the overnight, C^Rx^Rn) — 
0.4±0.1, is a still high but significantly smaller than C(Rt, Rd) values. The cross-correlation 
between day and night, C(Rn, Rd) = — 0.1±0.1, is distributed around with more tendency 
to have negative values. In summary, the total return is more synchronized with the daytime 
return. It is also interesting to note that there are significantly more stocks that have negative 
correlations between R^ and Rd- For example, 567 out of the 2215 stocks have values of 
C(Rn, Rd) < —0.2. This implies that the probability is relatively high for a large positive 
overnight return to be followed by a large negative daytime return. The overnight return and 
the daytime return tend to be slightly anti-correlated, and the total return usually moves in 
the same direction as the daytime return. 

Due to many factors, such as changes in the regulations or new technologies, the markets 
evolves with time. An interesting question arises, is the cross-correlation stable in the 
sample years studied. To test the stability of the cross- correlations, we recalculate the cross- 
correlations year by year. The records in 1 year are enough to calculate the cross-correlation 
and more importantly, the equity market in such a short period can be assumed stable. In 
Fig. [8] we plot the averages and standard deviations (as error bars) of the cross-correlations 
over the 2215 stocks against the year. For the three types of cross-correlations, the curves 
only slightly vary with the years and all changes lie within the error bars. Moreover, the 
error bars are almost the same for all years, which clearly shows that the cross-correlation 
is quite stable over the 20 years period studied. 



VI. DISCUSSION 



Returns and volatilities mi ght be affected by some factors, such as the market capital- 



ization and the mean volume 



121 ] . To test this for the entire stock market, we investigate 



the relation between the factors, including the capitalization and mean volume, and the 
measures, such as the tail exponent (, the correlation exponent a, and the cross-correlations 
between the three returns. There are some tendencies between these factors and measures. 
However, most of these tendencies are in the range of the error bars, which suggests no 
significant dependence between the two factors and three measurements. The behavior of 
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the three measures is quite universal over the entire market. To better understand the com- 
plexity of the equity market, the connection between different measurements and factors of 
stocks might need to be further analyzed. 

In summary, we examined the distributions of the total, overnight and daytime volatility. 
Compared to the exponential and power law with cutoff, power law distribution is found 
to be mostly better. The tail exponent ( is distributed among the different stocks between 
1.5 and 5 for the three types of volatility. We also analyzed the correlations in returns 
and volatilities of the components, using the DFA method. For both returns, there are no 
long-term correlations. However, for both volatilities, there are long-term correlations in 
all time scales and the correlations are even stronger in the long time scales. For the tail 
distribution and for the long-term correlations, the results of the two component returns 
and volatilities are similar to the total return and volatility. Moreover, the records of the 
daytime are more similar to the total of the same stock, suggesting that the daytime return 
contributes more to the total return. To better compare these similarities, we studied also 
the cross-correlations between the different types of return and found consistent behaviors, 
i.e., the daytime is more correlated to the total compared to the night time. Further, the 
cross-correlation between the overnight return and the daytime return varies for different 
stocks, and interestingly, a significant fraction of the 2215 stocks is far below 0. This finding 
suggests that the daytime return has a considerable probability to strongly anti-correlate 
with the overnight return. Furthermore, we examined the cross-correlations year by year 
and found that the behavior is quite stable over the 20-year period. 
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FIG. 1: (Color online) Illustration of the return time series. Three types of return, (a) the total 
return Rt, (b) the overnight return Rpj, and (c) the daytime return Rp of a typical stock, AA, 
are shown. We can see that the fluctuations of i?/v are relatively weaker, and the curve of Rd is 
more similar to that of Rt- 
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FIG. 2: (Color online) Typical cumulative distribution of the volatilities and power law fit to the 
tails. For four typical stocks, (a) AA, (b) CBM, (c) JNY, and (d) MI, three types of volatility, total 
volatility Vr (circles), overnight volatility Vjy (squares), and daytime volatility Vn (triangles) are 
demonstrated. The dashed lines are power law fits to the distribution tails. Note that the curves 
for Vt (circles) almost coincide with those of Vd and thus they are vertically shifted for better 
visibility. 
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FIG. 3: (Color online) Relation between the tail exponent £ for the total volatility Vr and that 
for the two component volatilities, (a) the overnight volatility Vjv and (b) the daytime volatility 
Vd- A point represents a stock which has good power law fit to the tail for the corresponding two 
types of volatilities. 1812 out of the 2215 NYSE stocks are exhibited in panel (a) and 2001 stocks 
are exhibited in panel (b). To show the tendency, we divide the entire data set into equal-width 
subsets according the value of £ for Vp and calculate the mean values and standard deviations 
in these subsets, as shown by the triangles and the error bars respectively. Both cases clearly 
show tendencies but that for the daytime volatility is stronger, indicating Vr is more influenced by 
Vd- Moreover, £ for all three types of volatilities are distributed in a relatively narrow range and 
centered around 3. 
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FIG. 4: (Color online) DFA fluctuation function F vs. windows size I for the returns (filled 
symbols) and the volatilities (open symbols). The four panels are for stocks AA, CBM, JNY, and 
MI respectively. For each case, three types of data, total (circles), overnight (squares), and daytime 
(triangles) are shown. Note that the curves are vertically shifted for better visibility. To obtain 
the correlation exponent a, we fit all curves with power laws, as illustrated by the dashed lines in 
panel (d). For the volatility, the exponent a is significantly different for short and long time scales, 
thus we split the entire range into two regimes and fit them separately. 
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FIG. 5: (Color online) Distribution of the correlation exponent a. for (a) the returns, (b) the 
volatilities of the short time scales, and (c) the volatilities of the long time scales. Three types of 
returns and volatilities, total (circles), overnight (squares), and daytime (triangles) are shown. All 
distributions approximately follow the normal distribution. For example, a normal distribution fit 
on the overnight return is shown by the dashed line in panel (a). For the volatilities, the curves 
for the total and the daytime almost collapse into a single curve in panels (b) and (c), suggesting 
that the total return is more influenced by the daytime return. 
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FIG. 6: (Color online) Reliability of the cross-correlations between the three types of returns, 
the total return Rt, the overnight return Rpf, and the daytime return Rq. For all four stocks 
(a) AA, (b) CBM, (c) JNY, and (d) MI, the two cross-correlations with respect to the total 
return are significant larger than their cross-correlations with the time lags, which suggests both 
component returns are strongly positively correlated to the total return. However, the cross- 
correlation between the two component returns varies with the stocks, e.g., it is positive for AA 
and negative for MI. 
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FIG. 7: (Color online) Distribution of the cross-correlations C between the returns Rt, Rn, and Re> 
for the 2215 NYSE stocks. Both cross-correlations with respect to the total return are significantly 
larger than and that for the daytime return is stronger, suggesting that the total return is more 
correlated to the daytime return. The cross-correlation between the two component returns is 
relatively more distributed towards negative values, indicating the two component returns tend to 
be anti-correlated. 
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FIG. 8: (Color online) Evolution of the cross-correlations between the three returns, Rt, Rn, and 
Rd, from 1988 to 2007. Here a point represents the average over the cross-correlations of the 2215 
NYSE stocks in a 1-year period, and the error bar is the corresponding standard deviation. Clearly, 
there are no significant changes for the cross-correlations over the 20 years studied. 
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